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TO ALL WHOM IT MAY CONCERN: 

Be it known that we, Yishay Mansour, a citizen of Israel, residing 
at 9 Ha'avoda Street, Apt. 24, Tel Aviv 63821, Israel, and Alexander 
Kesselman, a citizen of Israel, residing at 36/8 Ha'am Ha'tzarfati Street, Ramat 
Gan 52572, Israel, have invented a new and useful Harmonic Buffer 
Management Policy for Shared Memory Switches, of which the following is a 
specification. 
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BUFFER MANAGEMENT POLICY 
FOR SHARED MEMORY SWITCHES 

Field of the Invention 

This invention relates generally to buffer management policies, and 
specifically to buffer management for shared memory switches. 

Background of the Invention 

Because each node of a packet network typically receives packets from a 
multiplicity of other nodes and must re-direct these packets to a multiplicity of 
destinations (i,e. other nodes), the nodes of a network must be equipped with means 
for accomplishing such receipt and re-direct. Furthermore, because traffic in a 
network flows constantly, such means must be capable of performing both the receipt 
and the re-direct fimctions simultaneously. 

The receiving and re-directing functions required of a node (also known 
together as the routing functions) are performed by a device called a switch. As each 
node has a multiplicity of input and output links, the prime purpose of the switch is to 
route incoming packets arriving on a particular input link to the proper output link, A 
number of different techniques have been proposed for carrying out the switching 
(routing) fiinction. The present invention is concerned with improving one of these 
techniques, the technique known as shared memory. 

A shared memory switch consists of a single memory having multiple input 
and output lines. Fig. 1 depicts schematically a typical shared memory switch 10, in 
which the input lines of the switch, denoted generally as 12, are all connected to a 
single input port 14, and the output lines, denoted generally as 16, are connected to a 
single output port 18. It will be appreciated that although in Fig. 1 N input lines and 
N output lines are depicted, the number of input and output lines need not be equal. 
At given time intervals, packets arriving on the input lines (the first packet on each 
line at each time interval) are multiplexed into a single stream which is then fed 
through the single input port into the memory for storage. Once inside the memory, 
packets are organized into separate output queues, one queue for each output line. 
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The processes and apparatus controlling the storage of packets and their organization 
into queues are depicted schematically in Fig. 1 as controller 20, which controls the 
writing of multiplexed packets into the memory (WA) and the reading of multiplexed 
packets from the memory (RA). To send packets to their correct destinations, the 
process is reversed: at each time interval, an output stream of packets is formed by 
sequentially retrieving packets from the output queues (the first packet in each queue 
at each time interval), and this output stream is then de-multiplexed and the packets 
transmitted on the output lines. The multiplexing and de-multiplexing of packets is 
depicted schematically in Fig. 1 as S/P (serial-to-parallel) and P/S (parallel to serial) 
respectively. 

In practice, at any given time a shared memory switch is engaged in all of the 
input and output processes (multiplexing, storing, organizing into queues and 
de-multiplexing): at set time intervals, packets at the heads of non-empty queues are 
sent to the corresponding output link, and incoming packets are sent to their assigned 
queues. For purposes of discussion throughout the remainder of this patent 
application, the Applicants will relate to input and output lines of the shared memory 
switch as if these lines were coimected directly to the switch and the 
multiplexing/de-multiplexing step was unnecessary. 

Because multiple cells arriving at the same time on different input lines may 
be destined for the same output line, switches must also be capable of buffering 
(sometimes referred to as queuing). While in theory a switch may buffer cells at the 
input or the output of the switch, in shared memory switches a third possibility is 
available, viz. to buffer the cells internally. Shared memory switches have gained 
popularity because, by appropriately controlling the memory read/write, the switching 
and buffering functions of the shared memory switch may be achieved 
simultaneously. This feature imparts significant advantages to shared memory 
switches: such switches do not suffer from throughput degradation (a drop-off in the 
total number of cells passing through the switch) due to "head-of-line blocking'', a 
phenomenon inherent in input-buffered switches. 
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Buffer allocation determines how the total buffer space, i.e. the memory, will 
be used by individual output lines of the shared memory switch. The buffer allocation 
is in turned determined by implementation of a buffer management policy. Several 
such buffer management policies are known in the art. Known buffer management 
policies are based on a model of a shared memory switch having N output lines (and 
usually N input lines) and a total buffer space of M equally sized packets. Most of 
these policies operate on a non-preemptive basis, i.e. once a packet has been queued, 
it will not be removed from a queue, and therefore these policies involve determining 
whether an incoming packet will be accepted or rejected. (Packets which have been 
queued are sent to their destinations on a first-in-first-out (FIFO) basis.) For the sake 
of completeness, however, it is noted that some buffer management policies have 
been proposed which follow a preemptive policy. These buffer management policies 
are known as "push-out" or "drop-on-demand" and variations thereof, and involve 
"pushing out" an already queued packet under certain conditions. Because the 
"push-out" step is difficult to implement in practice at the speeds imder which modem 
packet switches must operate, "push-out" and its variants have not been widely 
adopted commercially. 

Of the non-preemptive buffer allocation policies, the two simplest policies are 
known as complete partitioning and complete sharing. In complete partitioning (CP), 
the entire buffer space is permanently partitioned among the N output lines. Although 
in principle each queue could be assigned a different amount of space, in practice 
under CP each queue is allocated an equal amount of bxaffer space, M/N packets for 
each of the N queues. Since the sum of individual output line allocations is equal to 
the total buffer memory M, complete partitioning does not provide any sharing. This 
can represented mathematically as follows: let M be the total buffer space (memory) 
in a shared memory switch having N output lines and N corresponding output queues, 
let Tq be the buffer space allocated to queue q, and let Lq be the buffer space actually 
occupied by queue q. In complete partitioning, 

Lq<Tq (1) 

for all q (q = 1 to N) and 
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ETq<M (2) 

(Implicitly, this also means that the sum of buffer space occupied by all 
queues camiot exceed the total amoimt of memory available, i.e. 

f,U<M (3) 

but this condition is true by definition for the shared memory switch, irrespective of 
the buffer allocation policy implemented.) 

The advantage of complete partitioning is that the determination to accept or 
reject a packet is easy to make: if the buffer allocation for the queue to which an 
incoming packet is assigned is already full, the incoming packet is rejected; if the 
buffer allocation for the queue to which the incoming packet is assigned is not full, 
the packet is accepted, CP also ensures that the memory is allocated fairly among the 
output queues. The disadvantage of a complete partitioning policy is that the total 
memory is often xmder-utilized, since unused memory from an inactive queue carmot 
be utilized by a highly active queue temporarily requiring additional memory. 

At the other extreme there is the complete sharing (CS) policy, in which all 
packets destined for all queues are accepted until the memory is completely fiill. 
Using the same notation as above, this can be expressed by adherence to condition 
(3), without adherence to conditions (1) or (2). While complete sharing ensures that 
the memory space will never be under-utilized (if necessary, the entire buffer space 
may be used), it also enables a single highly active queue to monopolize the buffer. 
This is unfair and can be detrimental to performance under some circumstances. For 
example, if the highly active queue contains packets which are part of an (extremely 
large) e-mail message, for which a delay of several seconds in transmission will not 
be critical, and other packets which are part of a real-time voice transmission are 
consequently rejected by the buffer, the recipient of the voice transmission will 
receive a garbled transmission. This intuitive insight has been borne out by computer 
modeling and simulation: under balanced traffic conditions, CS has been shown to 
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outperform CP (in the sense that fewer packets are lost in CS), but under imbalanced 
traffic conditions, which more closely mimic traffic conditions in actual ISDN 
networks, CP outperforms CS. 

To try to overcome the drawbacks of complete sharing and complete 
partitioning, but to still take advantage of the benefits of each of these buffer 
management policies, several hybrid buffer management policies have been proposed. 
One such policy is known as sharing with maximum queue lengths (SMXQ). In this 
policy, each output queue is allocated a finite amount of buffer space. Although in 
principle the allocation for each queue may vary, in practice each queue is typically 
allocated the same amount of buffer space. In this respect, SMXQ is similar to CP. 
However, unlike in CP, the total buffer allocations may together exceed the total 
memory space M. Furthermore, unlike in CP (but as in CS), sharing between output 
queues is allowed. SMXQ can thus be expressed mathematically by adherence to the 
conditions of equations (1) and (3) above, but the inequality in (2) is reversed to yield 
condition (2') which must also be adhered to: 

ETq>M (2') 

The result of these conditions is that no single queue is allowed to monopolize 
the total buffer space, but by the same token a queue in need of additional space is 
able to utilize some of the unused space allocated to another queue. The degree of 
sharing allowed will be determined by the maximum queue length allowed: the closer 
the maximum queue length is to M/N (or more generally, the closer the total 
allocation for all queues approaches M), the more closely the policy approaches CP, 
and the closer the maximum queue length is to M, the more closely the policy 
approaches CS. Thus in SMXQ the maximum queue length is set as less than M but 
greater than M/N. Computer modeling and simulation has shown that under 
imbalanced traffic conditions, SMXQ generally outperforms both CS and CP. 

It should be noted that to the best of the inventors' knowledge, there is at 
present no optimal solution for choosing the degree of sharing vmder imbalanced 



5 



41682s4.doc 



41682S4 



traffic conditions. The chief disadvantage of SMXQ is that if a situation arises where 
there are many active output Unes, the total buffer space taken by these active Hnes 
may leave some other (lightly loaded) output line with insufficient space. This can be 
a problem if it is desirable to distinguish between different classes of packets, e.g. 
real-time voice transmission packets vs. e-mail packets. 

Another solution intermediate between CP and CS is known as sharing with 
minimum allocation (SMA). In SMA, a minimum amount of buffer space is always 
reserved for each queue. In principle, the amount reserved for each queue may differ, 
but in practice an equal amoimt of space is reserved for each queue. The minimum 
amount Rq reserved for queue q is set as less than M/N (the reservation equivalent to 
each queue under a CP policy in which all queues receive equal allocations) and 
greater than 0 (the reservation at which the policy is equivalent to CS). More 
generally, all queues may be given different non-zero minimum allocations. In 
mathematical terms, this can be expressed as adherence to conditions (4) and (4'): 

^max(Lq,Rq)^M (4) 

Rq>0:q-1...N (4^ 

wherein max(Lq,Rq) refers to the larger of the two values Lq and Rq. 

A third solution intermediate between CP and CS, known as sharing with 
maximum queue and minimum allocation (SMQMA), combines both SMXQ and 
SMA: a minimum amount of space is allocated for each output queue, but a maximum 
queue length is also imposed. The guarantee that each output line has a minimum 
amount of space allocated serves as a simple and convenient method for ensuring that 
high-priority traffic is allocated buffer space, even in a shared buffer environment. 
An example of a commercially available shared memory switch employing an 
SMQMA buffer management policy is the Cisco Lightstream 1010 switch, in which 
every output line has a fixed reservation for high priority continuous bit rate traffic. 
The remainder of the buffer space is used in accordance with an SMXQ policy. 
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SMXQ, SMA and SMQMA all involve static thresholds: they were developed 
under assumptions of static environments where traffic loads may be imbalanced (i.e. 
the queues of some output lines may be more heavily loaded than the queues of other 
output lines) but the overall traffic loads do not change with time. Such assumptions 
do not reflect the reality of most computer networks, in which overall traffic loads 
vary significantly over time. This leads to degradation in the performance of shared 
memory switches when the actual traffic conditions encountered in the network 
deviate from the nominal values that the switches were designed to accommodate. 
This has led to attempts to develop buffer management policies which can adapt to 
changes in network traffic conditions. 

One such type of policy has been termed ''adaptive control (see Thareja & 
Agrawala, IEEE Trans. Commun., vol. COM-32, no. 6, June 1984, pp,737- 740). In 
this type of policy, the allocation for all output lines is revised in response to changes 
in traffic conditions. Traffic estimations are made by statistical measurements of 
incoming traffic, and these are compared to the conditions which existed at the last 
allocation update. The policy can thus determine when the allocation should next be 
revised and what the new allocation should be. While the general idea of an adaptive 
control policy appears useful, in practice it has been difficult to devise particular 
buffer management policies along these lines. For example, the statistical estimation 
of network traffic, which it was once thought would prove to be a trivial matter, has 
proven to be quite complicated. Also, it is much more difficult to determine an 
optimal allocation for N outputs than for two outputs, as was done by Thareja & 
Agrawala. Thus adaptive control policies have not been widely adopted. 

Another type of policy is known as dynamic threshold (DT) (described in 
Choudhury et al., IEEE/ACM Trans. Commun. 6(2) April 1998, pp. 130-140). In DT, 
the queue length thresholds of the output lines are set as proportional to the current 
amount of unused buffer memory. In mathematical terms, 

Lq < aF (5) 
for all q from 1 to N, wherein a is a proportionality constant and F is defined as 
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F = M-J^U (6) 

Packets or cells arriving for a given output line are rejected if the length of the 
queue for that output line exceeds the current threshold value, i.e. if condition (5) will 
be violated by acceptance of the incoming packet for queue q. During periods of 
overall light traffic, this enables some output queues experiencing relatively high 
activity to utilize most of the buffer space. As overall traffic flow increases and the 
demand for memory for other queues increases, the queue length threshold decreases, 
thus forcing heavily loaded queues to unload packets before said heavily loaded 
queues will be allowed to accept more packets, and enabling previously lightly loaded 
queues to fill up. 

The principal difficulty with the dynamic threshold policy is that the 
proportionality constant a is set at some value and remains at this value throughout 
the use of the shared memory switch. However, computer modeling and simulation 
has shown that while proportionality constants over a range of values produce results 
which are better than pure CP or pure CS (or, in general, even better than SMXQ), the 
performance of the DT policy under different conditions differs with the value of the 
proportionality constant. Consequently, no single proportionality constant can be 
chosen which performs optimally over all network traffic conditions. 

There is thus a need for a buffer management policy which can better optimize 
the use of the buffer space in a shared memory switch than buffer management 
poUcies which are presently known. 
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Siunmary of the Invention 

The present invention provides a method for managing the buffer allocation in 
shared memory devices. In a preferred embodiment of the invention, the device is a 
shared memory switch. This buffer allocation management method may be used to 
implement a known buffer management policy, such as SMXQ or, in a preferred 
embodiment of the invention, it may be used to implement a novel buffer 
management policy, which the inventors call the "harmonic" buffer management 
policy, which is also contemplated within the scope of the invention. 

In its simplest embodiment, the present invention comprises sorting the queues 
in a buffer according to queue length to obtain a sorted order of the queues, selecting 
a group consisting of k queues in accordance with the sorted order, and allocating a 
portion of the space in the buffer to the group as unit. In a preferred embodiment of 
the invention, the total buffer space allocated to the selected group of queues 
constitutes a maximum amount of space available to the selected group of queues as a 
unit. This maximum amount of buffer space allotted to the k queues as a unit may be, 
and in a preferred embodiment of the invention is, disproportionately greater than the 
total maximum space that the k queues would collectively receive if a CP policy was 
implemented with respect to all queues in the buffer under which each queue received 
an equal allocation. In another preferred embodiment of the invention, a minimum 
amount of buffer space is reserved for the k selected queues as a group, with the 
provision that a minimum amount of buffer space is reserved for the remaining 
queues as well, wherein the total minimum amount of space reserved for the k 
selected queues is largest total minimum amount of space reserved for any group of k 
queues. 

In accordance with a preferred embodiment of the invention, the allocation of 
space between the k queues of the selected group themselves is achieved by setting a 
set of conditions which must be adhered to in order for incoming packets destined for 
one of the k queues to be accepted. This set of conditions may be chosen so that the 
allocation between the queues of the group is equivalent to the implementation of a 
known buffer management policy with respect to the queues of the group, as will be 
explained in more detail hereinbelow. In a preferred embodiment of the invention, 
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the set of conditions is chosen so that the allocation of buffer space between the 
queues of the group is done in accordance with the static or dynamic "harmonic" 
buffer management policy of the present invention, as will be explained in more detail 
hereinbelow. 

Because the first step of the present method involves sorting and ordering the 
queues according to queue length, in principle all the queues in a buffer may be 
grouped into groups according to size. Each group of queues may contain an equal 
number of queues, e.g. in a buffer having 100 queues the queues may be divided into 
10 groups of 10 queues each (group 1 containing the 10 largest queues, group 2 
containing the 10 next largest queues, etc.) Alternatively, the groups of queues may 
contain different numbers of queues, e.g. in a buffer containing 100 queues the queues 
may be grouped into 4 groups, with the ten largest queues constituting group 1 , the 20 
next largest queues constituting group 2, the next 30 largest queues constituting group 
3 and the remaining 40 queues constituting group 4. Conceptually, this may be 
thought of as selecting a group of 60 queues which is then sub-divided into 3 groups 
of 10, 20 and 30 queues respectively, with a fourth non-selected group of 40 queues 
remaining, or it may be conceived of as grouping the sorted queues of the buffer into 
four groups a priori. Irrespective of how the groupings are conceptualized, the 
conditions imposed for acceptance of a packet into one group of queues (i.e. the 
system of equations and inequalities which must be adhered to) need not be the same 
as the conditions imposed for acceptance of a packet into a different group of queues. 
Thus, the buffer space within each group of queues may be allocated in accordance 
with different criteria. 

Preferably, the sorted order of the queues of the buffer is determined at regular 
intervals, either immediately prior to or immediately after packets are released from 
the queues. 

In accordance with a preferred embodiment of the invention, there is provided 
a method for allocating space among a plurality of queues in a buffer, comprising: 

sorting all the queues of the buffer according to size, thereby to establish a 
sorted order of the queues; 



10 



41682s4.cloc 



41682S4 



selecting at least one group of the queues consisting of a given number of the 
queues in accordance with said sorted order; 

allocating a portion of the space in the buffer to said at least one group, 
responsive to the number of the queues in said at least one group; and 

accepting a data packet into one of the queues in said group responsive to 
whether the data packet will cause the space occupied in the buffer by the queues in 
the group to exceed the allocated portion of the space. 

In a preferred embodiment of the invention, the queues of said at least one 
group of queues are largest among said plurality of queues. 

In a preferred embodiment of the invention, the step of allocating the portion 
of the space comprises setting a size of the space proportional to a sum of a harmonic 
series. 

In a preferred embodiment of the invention, the setting of the size of the space 
comprises establishing a total maximum buffer size Bk for the k largest output queues, 
wherein Bk is substantially given by 



wherein M is the total buffer space available to all queues, N is the number of 
output queues, and i is an integer index. 

In one preferred embodiment of the invention, the selecting of the number k of 
the queues comprises selecting a fixed number of the queues. 

In another preferred embodiment of the invention, k has a variable value. 

In a preferred embodiment of the invention, k and Bn are set in accordance 
with the equation: 



Bk- 



M ^1 



Bk = (n + 



m 



)• 



M 



c 



logcN 
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wherein 1 < c < N, and n is the largest integer such that m and k satisfy the conditions 



and 

n 

i=l 

In a preferred embodiment of the invention, buffer space is allocated within 
said at least one group of queues in accordance with a set of conditions which 
functionally define a known buffer management policy. In a preferred embodiment of 
the invention, the known buffer management policy is chosen from a group consisting 
of complete partitioning (CP), sharing with maximum queue lengths (SMXQ), sharing 
with minimum allocation (SMA), sharing with maximum queue length and minimxmi 
allocation (SMQMA), and dynamic threshold (DT). 

In a preferred embodiment of the invention, the given number is fixed. In 
another preferred embodiment of the invention, the given number and the portion of 
the space in the buffer are variable. 

There is also provided, in accordance with a preferred embodiment of the 
invention, a method for allocating space among N output queues in a buffer of size M, 
which comprises: 

sorting all the queues of the buffer according to size, thereby to establish a 
sorted order of the queues; 

selecting a number k of said N output queues in accordance with the sorted 

order; 

establishing a total maximum bixffer space of Bk for said number k of the N 
output queues, wherein < M; 

ascertaining whether acceptance of an arriving packet destined for one of said 
number k of the N output queues will cause the space in the buffer used by said k 
queues together to exceed Bk; and 
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if the acceptance of an arriving packet destined for one of the k queues will 
cause the space used by the k queues together to exceed Bk, rejecting said packet. 

In a preferred embodiment of the invention, the selecting of a nimiber k of said 
N output queues comprises selecting a number k of said N output queues that are 
largest among said N output queues. 

In a preferred embodiment of the invention, the establishing of the total 
maximum buffer space available for the k queues comprises setting the maximum 
buffer size for the k queues such that Bk is substantially given by 

M ^1 

Bk- V- 

(lnN + l)frfi 

wherein M is the total buffer space available to all queues, and N is the number of 
output queues, and i is an integer index which refers to the ordinal ranking of the 
queue length. 

In a preferred embodiment of the invention, buffer space is allocated within 
said number k of the N output queues in accordance with a set of conditions which 
functionally define a known buffer management policy. In a preferred embodiment of 
the invention, the known buffer management policy is chosen from a group consisting 
of complete partitioning (CP), sharing with maximum queue lengths (SMXQ), sharing 
with minimum allocation (SMA), sharing with maximum queue length and minimum 
allocation (SMQMA), and dynamic threshold (DT). 

In one preferred embodiment of the invention, the selecting of the number k of 
the queues comprises selecting a fixed number of the queues. 

In another preferred embodiment of the invention, the value of k is variable. 

In a preferred embodiment of the invention, the values of k and Bn are set in 
accordance with the equations 
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0 < m < c' 



and 



n 



m. 



1=1 



There is also provided in accordance with a preferred embodiment of the 
invention a shared memory switch comprising: 

a memory providing buffer space of size M, which is adapted to contain a 
pluraUty of output queues; and 

a controller, coupled to sort all the queues of the buffer according to size, 
thereby to establish a sorted order of the queues, and to allocate the space in the buffer 
to the output queues in accordance with the sorted order such that a portion of the 
space is allocated to a group of the queues consisting of a given number of the queues 
that are largest among the plurality of the queues responsive to the given number of 
the queues in the group. 

In a preferred embodiment of the invention, the portion of the space within the 
memory is allocated by setting a size of the space proportional to a sum of a harmonic 
series. 

In a preferred embodiment of the invention, the portion of the space within the 
memory is allocated for the k largest output queues and the total maximum buffer size 
Bk for said k largest output queues is substantially given by 



k 



M 1 
(lnN) + li 
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wherein i is an integer index which refers to the to ordinal ranking of the 
queue lengths. 

In a preferred embodiment of the invention, the portion of the space is 
allocated for the k largest output queues and the total maximum buffer size Bk for the 
k largest output queues is substantially given by the equation 

^ ^ m . M 

Bk = (n + — -) • 

c""^ logoN 

wherein 1 < c < N, and n is the largest integer such that m and k satisfy the conditions 



0<m<c"^^ 



and 



k = ^c' + m. 



In a preferred embodiment of the invention, the switch has N output lines and 
N corresponding output queues, wherein the given number is denoted by k and said 
portion of the space available for the k largest output queues is denoted by Bk, 
wherein Bk < M. 

In a preferred embodiment of the invention, Bk is substantially given by the 
equation 

Bk = V- 

(lnN + l)frti 

wherein i is an integer index which refers to the ordinal ranking of the queue in terms 
of queue length. 
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Brief Description of the Drawings 

The present invention will be better understood in connection with the 
drawings, in which: 

Fig. 1 is a schematic diagram of a typical shared memory switch known in the 
art; and 

Fig. 2 is a flow chart illustrating implementation of a buffer management 
policy for determining whether to accept or reject an incoming packet, in accordance 
with a preferred embodiment of the present invention. 
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Detailed Description of Preferred Embodiments of the Invention 

Reference is now made to Fig. 2. Fig. 2 shows schematically the operation of 
a buffer management policy in accordance with a preferred embodiment of the present 
invention. This policy is preferably implemented in a shared memory switch, such as 
that shown in Fig. 1, having N output lines. Alternatively, the policy may be adapted, 
mutatis mutandis, for use in managing input and output buffers in switches of other 
types, as well as for managing queues in buffers of other types. 

As shown at step 22 in Fig. 2, the N queues of the buffer are first sorted by 
queue length. In the following discussion, it will be assumed that the sorting will be 
done in descending order, i.e. from longest to shortest, but it will readily be 
appreciated that in the practice of the invention, the queues may be sorted in 
ascending order. 

As shown at step 23 in Fig. 2, the queues are then grouped into groups 
according to their sorted order. In the simplest case, depicted schematically in Fig. 2, 
the queues are divided into 2 groups, one group containing the k largest queues and 
the other group containing the remaining queues. However, it will be appreciated that 
in the practice of the invention, the queues may be grouped into a plurality of groups 
of queues, e.g. a first group containing the k largest queues, a second group containing 
the / next largest queues, a third group containing the m next largest queues, etc. 

As shown at step 24 in Fig. 2, criteria for the acceptance of a packet into one 
of the k longest queues are established. The criteria are set forth in the form of a set 
of equalities and/or inequalities which must be fulfilled in order for an incoming 
packet to be accepted into the group of the k longest queues. Similarly, criteria for 
acceptance of a packet into the group of the remaining queues are also established in 
the form of a set of equalities and/or inequalities which must be fulfilled in order for 
an incoming packet to be accepted into the group of the k longest queues. If the 
ordered queues are grouped into a plurality of groups, acceptance criteria for packets 
destined for each group are established in step 24. 
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It will be appreciated that although as shown in Fig. 2, step 24 follows steps 
22 and 23, in practice the criteria for acceptance of an incoming packet, i.e. the 
system of equalities and inequalities which must be adhered to, may be set prior to the 
sorting or grouping of queues according to queue length, and may in fact be set as 
permanently fixed a priori. 

The set of equalities and/or inequalities which constitute the acceptance 
criteria for an arriving packet may be the set of equalities and/or inequalities which 
effectively implement a known buffer management policy, such as CP, as explained 
above. With respect to the group of the k largest queues in particular, in a preferred 
embodiment of the invention the k largest queues are collectively allocated a 
maximum buffer space of Bk, where Bk < M, the total memory available in the buffer. 

The maximum buffer space Bk allocated for the k largest queues may be set as 
a fixed proportion of the total buffer memory M, e.g. (k/N) x M or M/2. In 
accordance with a preferred embodiment of the present invention, the total maximum 
buffer allocation Bk for the k queues of the group is set in accordance with the 
equation 

Bk = -Vt (7) 

ln(N + l)trYi 

where i is an integer index which indexes the k queues of the group, i.e. i refers to the 
ranked order of the queue in terms of queue length. Because this equation is based on 
the harmonic series 1/i, the buffer allocation policy based on (7) will be termed 
"Harmonic" throughout the remainder of this description. 

It will be appreciated by those skilled in the art of buffer management that since the 
queues have been sequentially organized on the basis of queue length, the Harmonic 
policy allocates a relatively large amount of buffer space for individual bursts of 
packets (up to M/((ln N) + 1) packets for one queue), but reserves buffer space for 
idle queues that may become active. This enables such idle queues to begin service 
immediately when called upon to do so. Furthermore, among the active queues, a 
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degree of fairness is achieved: if there are exactly k active queues, then, if no 
conditions other than those of the Harmonic policy are imposed, the queue lengths of 
the k active queues converge to Bk/k packets per queue. 

As shown at steps 26 and 28 in Fig, 2, a packet arriving for one of the k largest 
queues will be rejected if acceptance of the packet would cause the memory utilized 
by the k queues as a group to violate the acceptance criteria for that group of queues. 
Thus, for example, when the acceptance criterion for the k largest queues is the 
condition that the buffer space collectively occupied by the k largest queues not 
exceed Bk, an incoming packet destined for one of the k largest queues will be 
rejected if its acceptance would cause the space occupied by the k queues to exceed 
Bk- For example, if Bk is set in accordance with equation (7), then the packet will be 
rejected if acceptance of the packet would violate condition (8): 

|]Li<Bk (8) 

q = i 

wherein i is an integer index for the k queues of the group. 

It will be noted that adherence to condition (8) is a sufficient condition for 
adherence to the conditions of complete sharing within the k queues of the selected 
group. 

As shown at step 28 in Fig. 2, if the cumulative threshold allocation Br for the 
k queues of the group will not be exceeded by acceptance of the incoming packet, and 
there is space in the buffer to accept the incoming packet, then the packet will be 
accepted. 

As stated, the acceptance criteria for queues of either of the two groups of 
queues depicted in Fig. 2, or more generally for any of a plurality of groups of queues 
organized on the basis of queue length, may be chosen so as to effectively implement 
a known buffer management policy. Thus, for example, within the k queues of the 
group, the following policies could effectively be adopted, by choosing an appropriate 
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system of equalities and/or inequalities to form the acceptance criteria for the group of 
k queues: 

• A straight CS policy may be followed. As explained above, this is the 
default setting when no conditions beyond condition (8) are imposed on the 
acceptance or rejection of an incoming packet. 

• A strict CP-type allocation may be used, wherein each of the k queues of 
the group receives Bk/k memory (or wherein each of the remaining queues 
receives (M-Bk)/(N-k) memory). 

• SMXQ may be used, either with each of the k queues receiving the same 
maximum allocation or with each of the k queues receiving a different 
maximum allocation, as explained in the preceding paragraph, 

• An SMA-type policy or an SMQMA-type policy may be used, 

• The k queues of the group may also be made to follow a DT policy. 

Thus, for example, if a buffer management policy in accordance with the present 
invention is employed in which the acceptance criteria for an incoming packet are 
those equations and inequalities which define an SMXQ poUcy, then the incoming 
packet destined for one of the k queues of the group will be accepted only if such 
acceptance will not cause violation of either condition (1) above or condition (9): 

Bic<XTi (9) 

As shown at steps 26 and 30 in Fig. 2, if the incoming packet is not destined 
for one of the k largest queues, then the acceptance or rejection of the packet will 
depend on whether acceptance of the incoming packet will violate the terms of the 
buffer management policy implemented with respect to the N-k remaining queues. 

As stated above, the present method may be practiced by grouping the sorted 
queues into a plurality of groups on the basis of queue lengths. In the case where the 
allocation for the k largest queues is set as a maximum allocation Bk for the group of 
queues as a unit, this may conceptualized as sub-dividing the collective allocation Bk 
into allocations for sub-groups of queues within the k queues of the group, wherein 
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the sub-groups are also grouped on the basis of queue size. Thus, for example, 
maximum buffer space allocations for two groups of queues, one consisting of j 
queues (j < k), the other of k-j queues, may be made in accordance with the general 
equation 

Bk-j + Bj-Bk. 

In principle, the number of subgroups of queues may be increased, up to a total of k 
such subgroups (one queue per sub-group), although in the latter case, the buffer 
management policy would in effect be reduced to a CP policy. 

Thus, for example, for a shared memory switch having N outputs, the j largest 
queues could be allocated a maximum of Bj buffer space: 



M 



i- (10) 



(lnN + l)^i 

where] < k, and the next k-j largest queues could then be allocated Bk-j space, i.e. 



IVT ^1 

Bk-j = B.-Bi:=-— — £^ (11). 

(lnN + l)trj 1 



It will be appreciated that, as within the k queues of the group as a whole, 
sharing within each subgroup may be governed according to a CS policy (default in 
the absence of specification of acceptance criteria) or according to some other set of 
equalities and inequalities compatible with a non-preemptive buffer management 
scheme. 

As stated, the ability of buffer management policies in accordance with 
preferred embodiments of the invention to allow for the sub-grouping of queues 
means that, at the extreme, individual queues can be defined as a sub-group, e.g. for 
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each of the j largest queues (up to j = k), the maximum queue length for each queue 
can be set as 

B.==— (12) 
(liiN + l)i 

This is actually a special case of a CP policy within the j largest queues, 
wherein the allocation for each queue is set in accordance with equation (12). Under 
such a policy, an incoming packet destined for the i'^' largest queue (out of the j 
queues for which Bq is defined as in (12)) will then be rejected if its acceptance will 
violate the condition 

Li<Bi (13) 

i.e., if more than i queues would each occupy more than B; buffer space as a result of 
acceptance of the packet. 

The allocation of space within the other k-j queues of the group may similarly 
be effected in accordance with chosen acceptance criteria, which, if desired, may 
effectively implement known buffer management policies. 

Similarly, the allocation of space in the remaining N-k queues in the buffer 
may be allocated in accordance with chosen acceptance criteria, which, if desired, 
may effectively implement known buffer management poUcies. The acceptance 
criteria used in the different groups of queues need not be the same. 

Because, in accordance with the present invention allocation is determined on 
the ranked order of the sizes of the queues, the present method enables the shared 
memory switch to naturally adapt to the burstiness inherent in a network: when a 
queue of previously moderate activity suddenly becomes active, its allocation is 
allowed to increase, but not to increase so much that it monopolizes the available 
buffer space. 
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In some preferred embodiments of the invention, k is jfixed. In other preferred 
embodiments of the invention, k, and thus Bt, is changed dynamically, preferably in 
accordance with the equation: 

Bk-(n + — )•— — (14) 
c logcN 

wherein 1 < c < N, and n is the largest integer such that m and k satisfy the conditions 
0<m<c"^^ (15a) 

and 

k = 2]c' + m (15b) 

Buffer management policies based equations 14, 15(a) and 15(b) will be referred to 
hereinafter as "Parametric Harmonic" policies. It will be appreciate that a c value 
close to 1 approximates complete sharing between the queues of the group, and a c 
value close to N approximates complete partitioning between the queues of the group. 
The parametric harmonic policy divides the queues into logcN blocks each of length c* 
(wherein = 1,2, ... logcN). Each block is allocated about M/logcN memory slots. In 
accordance with the Parametric Harmonic policy, 

k 

2]Li<Bkfork-L..N (15c) 

The inventors have subjected a Harmonic buffer management policy in 
accordance with the present invention to computer simulations of network traffic 
conditions, which have shown that the Harmonic buffer management policy achieves 
high throughput and easily adapts to changing load conditions. In comparison to DT 
and SQXM policies over a variety of conditions, Harmonic buffer management tends 
to track the better performer of these two policies under all conditions. 
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It will be appreciated by persons skilled in the art that the present invention is 
not limited by what has been particularly shown and described hereinabove. Rather 
the scope of the present invention includes both combinations and subcombinations of 
the features described hereinabove as well as modifications and variations thereof 
which would occur to a person of skill in the art upon reading the foregoing 
description and which are not in the prior art 
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